BioHEL: Bioinformatics-oriented Hierarchical Evolutionary Learning
نویسنده
چکیده
This technical report briefly describes our recent work in the iterative rule learning approach (IRL) of evolutionary learning/genetics-based machine learning. This approach was initiated by the SIA system [12] A more recent example is HIDER [1]. Our approach integrates some of the main characteristics of GAssist [4], a system belonging to the Pittsburgh approach of Evolutionary Learning, into the general framework of IRL. Our aims in developing this system are use all the good characteristics of GAssist but at the same time overcome some of the scalability limitations that it presents. The document is splitted in five parts, knowledge representation, general workflow, fitness function, some illustrative results and further work. The system has the following characteristics: • Each individual is a single rule, and each GA run learns one rule at a time. • In order to learn all the rules of a domain this process is applied iteratively. At the end of each iteration the examples covered by the rule that have been learned are discarded from the training set. In this way, the GA is forced to explore other areas of the search space.
منابع مشابه
Speeding up the BioHEL evolutionary learning system using GPGPUs
The BioHEL system is an evolutionary learning system designed to cope with large-scale datasets. This system have several characteristics focused on tackling this kind of problems, such as special representation to determine the relevant attributes in a rule, the usage of a windowing system, among others. Recently, we have extended the system to perform the rule evaluation process inside NVIDIA...
متن کاملIntegrating memetic search into the BioHEL evolutionary learning system for large-scale datasets
Local search methods are widely used to improve the performance of evolutionary computation algorithms in all kinds of domains. Employing advanced and efficient exploration mechanisms becomes crucial in complex and very large (in terms of search space) problems, such as when employing evolutionary algorithms to large-scale data mining tasks. Recently, the GAssist Pittsburgh evolutionary learnin...
متن کاملAnalysis of mass spectrometry data from the secretome of an explant model of articular cartilage exposed to pro-inflammatory and anti-inflammatory stimuli using machine learning
BACKGROUND Osteoarthritis (OA) is an inflammatory disease of synovial joints involving the loss and degeneration of articular cartilage. The gold standard for evaluating cartilage loss in OA is the measurement of joint space width on standard radiographs. However, in most cases the diagnosis is made well after the onset of the disease, when the symptoms are well established. Identification of e...
متن کاملImproving the scalability of rule-based evolutionary learning
Evolutionary learning techniques are comparable in accuracy with other learning methods such as Bayesian Learning, SVM, etc. These techniques often produce more interpretable knowledge than e.g. SVM; however, efficiency is a significant drawback. This paper presents a new representation motivated by our observations that Bioinformatics and Systems Biology often give rise to very large-scale dat...
متن کاملSpecial issue on advances in Learning Classifier Systems
Learning Classifier Systems (LCS) constitute a uniquely adaptable class of learning framework existing at the intersection of machine learning and evolutionary computation. Fundamentally, an LCS combines genetic search with an appropriate learning strategy to evolve a rule set which collectively describes a temporal or spatial problem. Since their conceptualization, a variety of algorithmic arc...
متن کامل